A Modular Architecture for Separating Hypothesis Formation from Hypothesis Evaluation in Data-driven Machine Translation
نویسنده
چکیده
Recent research in statisticaland examplebased machine translation integrates ruleinduced structured representations with statistics and lexicalised exceptions. While rulebased approaches concentrate on the formation of partial translation hypotheses, probabilistic approaches are concerned with the evaluation and selection of the best hypotheses. Within the METIS-II framework, we propose a machine translation system which uses transfer and expander rules to build an AND/OR graph of partial translations and a statistical ranker to find the best path through the graph. The paper gives an overview of the architecture and an evaluation of the system for several languages.
منابع مشابه
Evaluation of Satisfaction with Post-War Reconstruction in light of the Sense of Belonging in the Survivors, A Case Study: Rafi, Khuzestan province
Evaluating and documenting post-war reconstruction (one of the most complex types of reconstruction processes) is a good way to learn from possible future accidents. The 8-year war between Iran and Iraq ruined the city of Rafi completely. The process of reconstruction began after the liberation with a different approach to other experiences (emphasis on the promotion of the role of survivors an...
متن کاملDriven Decoding for machine translation (Vers un décodage guidé pour la traduction automatique) [in French]
Driven Decoding for machine translation Recently, the concept of driven decoding (DD), has been sucessfully applied to the automatic speech recognition (speech-to-text) task : an auxiliary transcription guide the decoding process. There is a strong interest in applying this concept to statistical machine translation (SMT). This paper presents our approach on this topic. Our first attempt in dri...
متن کاملTechnical report : OpenMaTrEx , a free , open - source hybrid data - driven machine translation system ∗
This report describes OpenMaTrEx, a free/open-source hybrid data-driven machine translation system containing core example-based components based on the marker hypothesis. OpenMaTrEx comprises a marker-driven chunker, a collection of chunk aligners, tools to merge (“hybridise”) marker-based and statistical translation tables, two engines —a simple proof-of-concept monotone “example-based” recom...
متن کاملMachine Translation with Significant Word Reordering and Rich Target-Side Morphology
This paper describes the integration of morpho-syntactic information in phrase-based and syntax-based Machine Translation systems. We mainly focus on translating in the hard direction which is translating from morphologically poor to morphologically richer languages and also between language pairs that have significant word order differences. We intend to use hierarchical or surface syntactic m...
متن کاملThe Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007